Glissando: a corpus for multidisciplinary prosodic studies in Spanish and Catalan

نویسندگان

  • Juan María Garrido
  • David Escudero Mancebo
  • Lourdes Aguilar
  • Valentín Cardeñoso-Payo
  • Emma Rodero
  • Carme de la Mota
  • César González Ferreras
  • Carlos Vivaracho-Pascual
  • Silvia Rustullet
  • Olatz Larrea
  • Yesika Laplaza
  • Francisco Vizcaíno
  • Eva Estebas-Vilaplana
  • Mercedes Cabrera
  • Antonio Bonafonte
چکیده

Literature review on prosody reveals the lack of corpora for prosodic studies in Catalan and Spanish. In this paper, we present a corpus intended to fill this gap. The corpus comprises two distinct data-sets, a news subcorpus and a dialogue subcorpus, the latter containing either conversational or task-oriented speech. More than 25 h were recorded by twenty eight speakers per language. Among these speakers, eight were professional (four radio news broadcasters and four advertising actors). The entire material presented here has been transcribed, aligned with the J. M. Garrido S. Rustullet Y. Laplaza Computational Linguistics Group (GLiCom), Department of Translation and Language Sciences, Universitat Pompeu Fabra, Barcelona, Spain J. M. Garrido e-mail: [email protected] D. Escudero (&) V. Cardeñoso C. González C. Vivaracho Department of Computer Sciences, Universidad de Valladolid, Valladolid, Spain e-mail: [email protected] L. Aguilar C. de-la-Mota Department of Spanish Philology, Universitat Autònoma de Barcelona, Barcelona, Spain E. Rodero O. Larrea Department of Communication, Universitat Pompeu Fabra, Barcelona, Spain F. Vizcaı́no M. Cabrera Department of Modern Languages, Universidad de las Palmas de Gran Canaria, Las Palmas de Gran Canaria, Spain E. Estebas Department of Modern Languages, Universidad Nacional de Educación a Distancia, Madrid, Spain A. Bonafonte Department of Signal Theory and Communications, Universitat Politècnica de Catalunya, Barcelona, Spain 123 Lang Resources & Evaluation (2013) 47:945–971 DOI 10.1007/s10579-012-9213-0

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Word prosody in early child Catalan, Spanish and English

The goal of this study is to examine the acquisition of prosody at the word level in early child Catalan, Spanish and English. We used a controlled naming task to elicit speech from 36 children; 12 English, 12 Catalan, and 12 Spanish, aged 2, 4 and 6 in order to analyze the acquisition of prosodic words with increasingly complex forms (S, WS, SW, WSW, SWW, WWS, SWSW; 3 target words per prosodic...

متن کامل

On the use of a fuzzy classifier to speed up the Sp_ToBI labeling of the Glissando Spanish corpus

In this paper, we present the application of a novel automatic prosodic labeling methodology for speeding up the manual labeling of the Glissando corpus (Spanish read news items). The methodology is based on the use of soft classification techniques. The output of the automatic system consists on a set of label candidates per word. The number of predicted candidates depends on the degree of cer...

متن کامل

SegProso: A Praat-Based Tool for the Automatic Detection and Annotation of Prosodic Boundaries in Speech Corpora

In this paper we describe SegProso, a Praat-based tool for the automatic segmentation in prosodic units of speech corpora. It is made up of a set of Praat scripts that add several tiers, each one containing the segmentation of a different unit, to a previously existing TextGrid file including the phonetic segmentation of the associated wav file. It has been successfully used for the annotation ...

متن کامل

The relevance of metrical information in early prosodic word acquisition: a comparison of Catalan and Spanish.

This paper focuses on the development of Prosodic Word shapes in Catalan, a language which differs from both Spanish and English in the distribution of PW structures. Of particular interest are the truncations of initial unstressed syllables, and how these develop over time. Developmental qualitative and quantitative data from seven Catalan-speaking children reveal that maximality constraints a...

متن کامل

The relevance of prosodic structure in tonal articulation Edge effects at the prosodic word level in Catalan and Spanish

A production experiment with 1600 potentially ambiguous utterances distinguished by word boundary location in Catalan and Spanish (e.g., Cat. mirà batalles ‘(s)he looked at battles’ vs. mirava talles ‘I/(s)he used to look at carvings’; Span. da balazos ‘(s)he fires shots’ vs. daba lazos ‘I/(s)he gave ribbons’; stressed syllables are underlined) was undertaken. Results revealed strong (and stati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Language Resources and Evaluation

دوره 47  شماره 

صفحات  -

تاریخ انتشار 2013